Name | Version | Summary | date |
phasellm |
0.0.22 |
Wrappers for common large language models (LLMs) with support for evaluation. |
2024-05-18 22:48:40 |
agenta |
0.14.10 |
The SDK for agenta is an open-source LLMOps platform. |
2024-05-18 12:42:43 |
torcheval-nightly |
2024.5.18 |
A library for providing a simple interface to create new metrics and an easy-to-use toolkit for metric computations and checkpointing. |
2024-05-18 12:07:41 |
dyff |
0.19.0 |
Meta-package to install the local SDK for the Dyff AI auditing platform. |
2024-05-17 19:35:53 |
dyff-audit |
0.3.4 |
Audit tools for the Dyff AI auditing platform. |
2024-05-17 05:22:52 |
dyff-client |
0.6.0 |
Python client for the Dyff AI auditing platform. |
2024-05-17 04:26:22 |
dyff-schema |
0.6.0 |
Data models for the Dyff AI auditing platform. |
2024-05-16 21:20:41 |
AutoRAG |
0.1.12 |
Automatically Evaluate RAG pipelines with your own data. Find optimal structure for new RAG product. |
2024-05-16 12:31:36 |
langsmith |
0.1.59 |
Client library to connect to the LangSmith LLM Tracing and Evaluation Platform. |
2024-05-16 01:56:09 |
codebleu |
0.6.1 |
Unofficial CodeBLEU implementation that supports Linux, MacOS and Windows available on PyPI. |
2024-05-14 21:29:39 |
uptrain |
0.7.1 |
UpTrain - tool to evaluate LLM applications on aspects like factual accuracy, response quality, retrieval quality, tonality, etc. |
2024-05-14 09:19:40 |
redlite |
0.2.0 |
LLM testing on steroids |
2024-05-10 17:31:30 |
promptmodel |
0.1.19 |
Prompt & model versioning on the cloud, built for developers. |
2024-05-10 02:36:18 |
evo |
1.28.0 |
Python package for the evaluation of odometry and SLAM |
2024-05-09 10:33:54 |
langcheck |
0.7.1 |
Simple, Pythonic building blocks to evaluate LLM-based applications |
2024-05-08 14:45:03 |
trajectopy |
2.0.14 |
Trajectory Evaluation in Python |
2024-05-08 10:36:42 |
trajectopy-core |
3.1.0 |
Trajectory Evaluation in Python |
2024-05-08 10:34:26 |
dinglehopper |
0.9.6 |
The OCR evaluation tool |
2024-05-06 15:51:57 |
ragrank |
0.0.7 |
An evaluation library for RAG models |
2024-05-05 14:07:20 |
mlrl-testbed |
0.10.0 |
Provides utilities for the training and evaluation of multi-label rule learning algorithms |
2024-05-05 00:06:13 |